Computational Tools and Resources in Plant Genome Informatics

نویسندگان

  • Todd J. Vision
  • Aoife McLysaght
چکیده

Though all biologists deal with information, only recently have the computational challenges of systematically collecting, storing, organising, manipulating visualising and analysing large amounts of biological information come to be widely appreciated. The cause of this is the explosive growth of genomics. The term bioinformatics was originally coined for the application of information technology to large volumes of biological, and particularly genomic, data. The field of bioinformatics has come to be intermingled with traditional computational biology and biostatistics, which are strictly concerned not with how to handle the information itself, but rather how to extract biological meaning from it. Thus, bioinformatics, in its broad sense, can be seen as providing both the infrastructure and the scientific framework in which biologists take information and use computers to help convert it into knowledge. Despite the relative youth of the field as a recognised discipline, there is an impressive diversity of bioinformatics resources currently available. By necessity, we only focus on a small slice of this diversity here. We pay particular attention to sequence analysis because of its centrality to genomics. We also do not attempt to provide specific protocols, as the specific needs of users vary greatly. The resources we describe range drastically in sophistication from little tested programs posted on graduate student web pages to very stable and complex databases maintained by governmental agencies. The better ones typically provide manuals and tutorials, often containing descriptions of the underlying principles. The reader is strongly advised to consult the documentation available for each tool. Though a wide array of commercial resources exist, some of which are ideally suited to specific tasks, many of the most fundamental and longlived bioinformatics tools are freely available. For this reason, we describe primarily non-commercial software in this chapter. Many of the databases and analysis tools we describe are hosted by government or academic research centres and can be accessed via user-friendly web interfaces. Tables 4.1 and 4.2 list the Uniform Resource Locators (URLs) for all the online resources that are discussed in the text.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

From plant genomes to protein families: computational tools

The development of new high-throughput sequencing technologies has increased dramatically the number of successful genomic projects. Thus, draft genomic sequences of more than 60 plant species are currently available. Suitable bioinformatics tools are being developed to assemble, annotate and analyze the enormous number of sequences produced. In this context, specific plant comparative genomic ...

متن کامل

RepeatExplorer: a Galaxy-based web server for genome-wide characterization of eukaryotic repetitive elements from next-generation sequence reads

MOTIVATION Repetitive DNA makes up large portions of plant and animal nuclear genomes, yet it remains the least-characterized genome component in most species studied so far. Although the recent availability of high-throughput sequencing data provides necessary resources for in-depth investigation of genomic repeats, its utility is hampered by the lack of specialized bioinformatics tools and ap...

متن کامل

Genes and networks regulating root anatomy and architecture.

The root is an excellent model for studying developmental processes that underlie plant anatomy and architecture. Its modular structure, the lack of cell movement and relative accessibility to microscopic visualization facilitate research in a number of areas of plant biology. In this review, we describe several examples that demonstrate how cell type-specific developmental mechanisms determine...

متن کامل

Grains of knowledge: genomics of model cereals.

The economic and scientific importance of the cereals has motivated a rich history of research into their genetics, development, and evolution. The nearly completed sequence of the rice genome is emblematic of a transition to high-throughput genomics and computational biology that has also pervaded study of many other cereals. The relatively close (ca. <50 million years old) relationships among...

متن کامل

A global representation of the carbohydrate structures: a tool for the analysis of glycan.

Glycan resources have been developed of late, such as carbohydrate databases, analysis tools, and algorithms for analysis of carbohydrate features. With this background, bioinformatics approaches to carbohydrate research have recently begun using a large amount of protein and carbohydrate data. This paper introduces one of these projects that elucidates the range of carbohydrate structures. In ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2003